BEST AVAILABLE COPY 



(19) 



J 



EuropMisches Patentamt 
European Patent Office 
Office europeen des brevets 



(12) 



(ID EP1 414 227 A1 

EUROPEAN PATENT APPLICATION 



(43) Date of publication: 

28.04.2004 Bulletin 2004/18 

(21) Application number: 02354169.1 

(22) Date of filing: 24.10.2002 



(51) IntCI. 7 : H04M 3/56 



(84) Designated Contracting States: 


• Brandt, Marc 


AT BE BG CH CY CZ DE OK EE ES Fl FR GB GR 


38320 Eybens (FR) 


IE IT LI LU MC NL PT SE SK TR 


• Caradec, Jean-Philippe 


Designated Extension States: 


38180 Seyssins (FR) 


AL LT LV MK RO SI 






(74) Representative: Lloyd, Richard Graham (GB) 


(71) Applicant: Hewlett-Packard Company 


Hewlett-Packard France 


Palo Alto, CA 94304 (US) 


Intellectual Property Section 




Legal Department 


(72) Inventors: 


Etablissement de Grenoble 


• Sauvage, Pierre 


36053 Grenoble Cedex 09 (FR) 


38450 Notre Dame de Commlers (FR) 





(54) Event detection for multiple voice channel communications 



(57) Apparatus for controlling a communications 
system having a plurality of voice channels and a user 
terminal for receiving at least one of the voice channels 
comprising: a receiving element for receiving a plurality 
of the voice channels, a controller for identifying one of 



the voice channels to be monitored, an event detection 
element for detecting the presence of a predeterminable 
event in the identified voice channel, and an alert gen- 
erator for generating an alert when the predetermined 
event is detected. 
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Description 

[0001] The present invention relates generally to the 
field of telecommunications and more particularly to tel- 
ecommunications involving more than one voice chan- 
nel. 

[0002] Due to significant technology advances in re- 
cent years many telecommunications systems now en- 
able users to establish and control multiple voice chan- 
nel communications with relative ease. An example of 
a multiple voice channel communication service is call 
waiting, in which a calling party may establish separate 
connection paths with two or more called parties and 
may selectively switch between communicating with 
each party. Each of the connection paths provides a 
separate voice channel through which voice communi- 
cation may take place. With call waiting, since generally 
only one call can be active at any one time, there is what 
may be referred to as a 'foreground' voice channel for 
the current active call through which two-way commu- 
nication may take place, and a 'background' voice chan- 
nel for the current call on hold through which generally 
no communication may take place. 
[0003] The number of voice channels present, how- 
ever, is not necessarily linked to the number of active 
connections. For example, multiple voice channels may 
also exist in telephone-based audio conferencing sys- 
tems, even where only a single connection path is es- 
tablished between a caller and an audio conferencing 
service. In an audio conference it is usual for all parties 
to the conference to participate in a single voice channel 
through which all the parties may talk and listen to the 
other parties. It is also becoming increasingly common 
to enable subconferences to be established within an 
audio conference from a subset of the participants. A 
subconference typically allows the creation of an addi- 
tional and separate voice channel in which only parties 
to that subconference or voice channel may participate. 
Typically no audio signals are received from the main 
audio conference by participants of a subconference, 
however systems do now exist which enable voice sig- 
nals from a background voice channel to be mixed with 
audio signals from a foreground voice channel. Such 
systems, such as that described in US 6404873 to Bey- 
da et al., enable a user to hear voice signals from the 
main audio conference at the same time as participating 
in the subconference. 

[0004] However, problems can exist in such multiple 
voice channel environments due in part to the limited 
way in which users may control and manage how they 
receive voice signals from different voice channels. For 
example, if a system is arranged such that a user does 
not receive voice signals from a background voice chan- 
nel any information carried in that voice channel will be 
missed by the user. However, if a system is configured 
such that a user simultaneously receives voice signals 
from multiple voice channels there is an increased risk 
that information may be missed due to overloading of 
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human audible senses. Although, from a technical view- 
point, multiple voice channels may provide numerous 
benefits to users, users are currently not always able to 
take full advantage of these benefits due to physical hu- 
5 man constraints in coping with information coming from 
multiple sources simultaneously. 
. [0005] Accordingly, one aim of the present invention 
is to help alleviate at least some of the above-mentioned 
problems. 

w [0006] According to a first aspect of the present inven- 
tion, there is provided apparatus for managing a com- 
munications system having a plurality of voice channels 
and a user terminal for receiving at least one of the voice 
channels. The apparatus comprises a receiving element 

is for receiving a plurality of the voice channels, a control- 
ler for identifying one of the voice channels to be moni- 
tored, an event detection element for detecting the pres- 
ence of a predeterminable event in the identified voice 
channel, and an alert generator for generating an alert 

20 when the predeterminable event is detected. 

[0007] Advantageously this allows a user to be able 
to interact with a far greater number of simultaneous 
voice channels than is possible just using the human 
senses. By allowing selected voice channels to be mon- 

25 itored automatically a user can decide not to receive 
voice signals from these channels although can rely on 
the automatic monitoring of these channels to alert him 
to the presence of predeterminable events occurring 
within those channels. 

30 [0008] Preferably the controller is adapted to identify 
a voice channel in response to a request from the user 
terminal. 

[0009] The predeterminable event may be the occur- 
rence of a keyword in which case the event detection 
35 element may be adapted to detect the keyword through 
speech recognition. 

[0010] The predeterminable event may also be, for 
example, a silence period. 

[001 1] The controller may also be adapted for identi- 
40 fying a plurality of voice channels to be monitored and, 
in which case, the event detection element may be 
adapted for monitoring each selected voice channel for 
a different event. 

[0012] The alert generator may be adapted for trans- 
45 mitting an audible alert to the user terminal. In one em- 
bodiment an audible alert may be transmitted by mixing 
an audible alert with the at least one voice channel re- 
ceived by the user terminal, in a further embodiment the 
audible alert is preferably transmitted at a time when the 
50 audio level of the at least one voice channel received by 
the user terminal is below a predetermined threshold. 
The alert generator may alternatively be adapted for 
transmitting a signal to the user terminal to thereby 
cause the user terminal to generate a local alert. 
55 [0013] In a preferred embodiment events to be detect- 
ed are definable by the user of the user terminal. 
[001 4] The apparatus may further comprise a record- 
ing element to record a portion of the monitored voice 
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channel around the detected event. The alert generator 
may then be adapted for playing the recorded portion to 
the user. 

[0015] The apparatus may also comprise an element 
for automatically establish ing a voice channel with a pre- s 
determinable destination, in which case the controller 
may be adapted for selecting that voice channel for 
monitoring. 

[0016] According to a second aspect of the present 
invention, there is provided a method of managing a 
communications system having a plurality of voice 
channels and a user terminal for receiving at least one 
of the voice channels. The method comprises receiving 
a plurality of the voice channels, identifying one of the 
voice channels to be monitored, detecting the presence 
of a predeterminable event in the identified voice chan- 
nel, and generating an alert when the predetermined 
event is detected. 

[0017] Preferably the step of identifying a voice chan- 
nel is made in response to a request from the user ter- 
minal. 

[001 8] The step of identifying a voice channel may al- 
so be adapted for identifying a plurality of voice channels 
and, in which case, the step of detecting may be adapted 
for monitoring each selected voice channel for a differ- 
ent event. 

[0019] The step of generating an alert may comprise 
transmitting an audible alert to the user terminal. In one 
embodiment the step of generating an alert may com- 
prise mixing an audible alert with the at least one voice 
channel received by the user terminal. Preferably the 
alert is transmitted to the user terminal at a time when 
the audio level of the at least one voice channel received 
by the user terminal is below a predetermined threshold. 
Alternatively, the step of generating an alert may com- 
prise transmitting a signal to the user terminal to thereby 
cause the user terminal to generate a local alert. 
[0020] Preferably the step of detecting is performed 
by detecting user definable events. 
[0021] The method may also include automatically 
establishing a voice channel with a predeterminable 
destination and selecting that voice channel for moni- 
toring, 

[0022] According to a further aspect of the present in- 
vention there is provided a user terminal operating in 
accordance with the above-described method. 
[0023] According to a yet further aspect of the present 
invention there is provided apparatus for detecting 
speech in a telecommunications system having a plu- 
rality of voice channels and a user terminal for receiving 
at least one of the voice channels. The apparatus com- 
prises a receiving element for receiving a plurality of the 
voice channels, a controller for identifying one of the 
voice channels to be monitored, a speech recognition 
engine for detecting the presence of a predeterminable 
keyword in the identified voice channel, and an alert 
generator for generating an alert when the predetermi- 
nable keyword is detected. 



[0024] Various embodiments of the present invention 
will now be described, by way of example only, with ref- 
erence to the accompanying diagrams, in which: 

Figure 1 is a block diagram showing a system ac- 
cording to a first embodiment of the present inven- 
tion; 

Figure 2 is a block diagram showing the monitoring 
element of Figure 1 in greater detail; 
Figure 3 is a block diagram showing a further em- 
bodiment of the present invention; and 
Figure 4 is a block diagram illustrating a yet further 
embodiment of the present invention. 

[0025] Figure 1 is a block diagram showing a multiple 
voice channel system according to a first embodiment 
of the present invention. Figure 1 shows an audio con- 
ference system 106 which allows an audio conference 
call to be established between the user terminals 100, 
102 and 104. As is well known in the art, audio confer- 
ences may be established in many different ways, such 
as by using a dial-in or dial-out service, and such tech- 
niques will not be discussed further herein. 
[0026] As is also well known, a userterminal 1 00 may 
be used to establish a subconference within the main 
audio conference with, for example, the user terminal 
102. As is typical with such audio conference systems, 
once a subconference is created the userterminal 1 00 
may only communicate directly with the other members 
of the subconference. In prior art conferencing systems, 
whilst participating in a subconference, any information 
portrayed in the main conference would not be received 
by the userterminal 100, and hence would have been 
missed by a user. To help overcome this problem, a 
monitoring element 108 is provided as shown in Figure 
1. 

[0027] The monitoring element 108 acts to monitor 
selected voice channels and to provide an alert when 
any predeterminable voice tags or keywords are detect- 
ed therein. In an audio conference, for example, the 
monitoring element may be used to monitor the main 
audio conference whilst a user is participating in a sub- 
conference. One benefit of this is that it allows a user to 
better cope with multiple voice channel environments, 
and a user is no longer constrained by his own ability to 
monitor and to react to audible information from multiple 
sources. 

[0028] The monitoring element 1 08 is shown in great- 
er detail in Figure 2 and is described below. 
[0029] Figure 2 shows the audio conference system 
106 of Figure 1 which receives voice signals from each 
of the user terminals 100, 102 and 104 as previously 
described. The audio conference system 1 06 manages, 
controls and performs all the necessary functions to en- 
able audio conferences, subconferences and the like to 
be established, managed and controlled. 
[0030] In multiple voice channel environments a voice 
channel may appear differently to different users. For 
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example, a voice channel which appears as a fore- 
ground voice channel to one user may equally appear 
as a background voice channel to a different user. As 
previously mentioned, a foreground voice channel typi- 
cally allows two-way voice communication, whereas a 5 
background voice channel typically does not. For exam- 
ple, a party on-hold in a call waiting system (i.e. a back- 
ground voice channel) Is typically unable to communi- 
cate with the other party until the on-hold call is made 
the active call. The following description is considered 
from the point of view of the user terminal 1 00. 
[0031] Voice signals from each of the voice channels 
102 and 104 are input to a routing element 212 of the 
monitoring element 108. Under control of a controller 
218, the routing element may direct voice signals from 
any combination of the voice channels 102 and 104 to 
an automatic speech recognition (ASR) engine 21 4. For 
example, the controller may be configured to enable 
voice signals from a main audio conference to be mon- 
itored whilst the user terminal 100 is participating in a 
subconference. Through the controller 21 8 the ASR en- 
gine 214 may be configured to monitor a selected voice 
channel for the presence of one or more voice tags or 
keywords. A voice tag may comprise, for example, a 
word, a phrase, an utterance or any other identifiable 
sound. The ASR engine may, for example, be one of the 
many ASR engines currently on the market, as will be 
appreciated by those skilled in the art. Preferably the 
ASR engine is capable of analyzing continuous speech 
in one or more languages. 

[0032] Upon detection of a voice tag by the ASR en- 
gine 21 4 a signal is transmitted to an alert manager 21 6 
which is responsible for generating an appropriate alert. 
An alert may, for example, consist of an alert to a user 
of the user terminal 100, an alert to the user terminal 
100 itself, or even an alert to another user or another 
device as will be described below. 
[0033] The alert manager 216 may alert the user of 
the user terminal 100 in any number of ways. For exam- 
ple, the alert manager may cause an audible alert to be 
mixed with the voice signals sent from the audio confer- 
encing system 1 06 to the user terminal 1 00. An audible 
alert may include, amongst others, an audible tone, a 
spoken alert and a recording of a portion of the moni- 
tored voice channel. For example, it may be preferable 
to continually record, for instance in a circular buffer or 
recording element, the voice channel which is being 
monitored. Thereafter, if a keyword is detected within 
the voice channel, the alert may consist of playing to the 
user a few seconds of the recording occurring around 
the detection of the keyword so that user may better un- 
derstand the context of the detected keyword. 
[0034] An alert may also be non-audible and may, for 
example, cause the audio conference system 106 to 
switch the voice channel in which the keyword was de- 
tected to be the current foreground voice channel. For 
example, if a voice tag is detected in the main audio con- 
ference the alert may cause the user to leave a subcon- 



ference to rejoin the main audio conference. Such an 
alert may also be arranged to cause all participants of 
the subconference to rejoin the main audio conference. 
[0035] In a preferred embodiment, a spoken or whis- 
pered alert is given to the user during a suitable pause 
in the conversation, much in the way that a person might 
interrupt someone not in mid-flow, but at an appropriate 
break-point in the conversation. Such an interruption 
may be detected, for example, by determining the pres- 
ence of a silent gap, or a period when the audio level in 
the voice channel is below a predeterminable threshold. 
[0036] The alert manager may also cause an alert to 
be sent to the user terminal 100 itself. This may be, for 
example, using in-band signaling, or out-of-band sign- 
aling such as a short message (SMS) or Email mes- 
sage. Upon receipt of an alert the user terminal 1 00 may 
generate a local alert to the user of the terminal. For 
example, a local alert may include flashing a light, caus- 
ing the user terminal to vibrate or sounding an alarm 
within the user terminal 100. 

[0037] The alert manager may also cause an alert to 
be sent to an external device, such as a radio pager, 
mobile telephone, email account and so on. Such an 
alert may be sent in any appropriate format, such as 
SMS, Email and the like. 

[0038] Preferably the way in which the alert manager 
216 generates alerts is user definable, for example by 
storing a set of user preferences in the controller 21 8. 
[0039] For clarity of explanation the example de- 
scribed above in relation to Figure 2 only shows that a 
single voice channel may be monitored at one time. 
However, in a preferred embodiment the controller 218 
may be configured to allow multiple voice channels to 
be monitored simultaneously for the presence of a set 
of predeterminable voice tags. Additionally the control- 
ler 218 may be configured to monitor different voice 
channels for the presence of different sets of voice tags. 
[0040] Although only one ASR engine 2 1 4 is shown it 
will be appreciated that multiple ASR engines, or multi- 
ple instances ofthe ASR engine, may be implemented 
to enable more efficient monitoring of multiple voice 
channels for one or more sets of voice tags. For exam- 
ple, a different ASR engine may be used for each differ- 
ent voice channel to be monitored, each ASR engine 
being configured through the controller 21 8 to detect the 
required set of voice tags. 

[0041] Preferably the monitoring element 108 may be 
configured through the user terminal 100 using, for ex- 
ample, dual tone multi-frequency (DTMF) tones or voice 
commands. 

[0042] Preferably the voice tags which are to be de- 
tected are user definable by the user of the user terminal 
1 00 and may be stored, for example, in a storage device 
(not shown) such as a memory or a disk drive within the 
monitoring element 108. Voice tags may, for instance, 
be stored by speaking the desired keywords and having 
them recorded by the monitoring element 108. It may 
also be possible for one or more sets of user defined 
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voice tags to be stored in a user profile for use with the 
monitoring element. The user profile may be stored as 
part of the monitoring element 108, or may be stored 
externally to the monitoring element. When stored ex- 
ternally to the monitoring element, for example, on a In- s 
ternet-linked computer server, only a link or other loca- 
tion indicator to the user profile need be provided to en- 
able the user defined voice tags to be accessed. For 
example, it may be preferable for the user defined voice 
tags to be entered in text format via an Internet web 
page. 

[0043] For clarity the above-description is made from 
the point of view of the user terminal 100. It will be ap- 
preciated, however, that the monitoring element 1 08 
may also be implemented for each of the participants to 
the audio conference system, thereby allowing each of 
the participants to selectively monitor any voice channel 
which is available to them. 

[0044] Although the monitoring element 108 of Fig- 
ures 1 and 2 is illustrated as being a single element, it 
will be appreciated by those skilled in the art that the 
sub-systems 212, 214, 216 and 218 are not limited to 
being located within a single module or element, and 
one or more of these sub-systems may be remote from 
the others, for example, such as being distributed over 
a network. Such an embodiment is described below with 
reference to Figure 3. 

[0045] Figure 3 is a block diagram showing a further 
embodiment of the present invention, in which monitor- 
ing of one or more voice channels, in the manner gen- 
erally described above, may be provided, for example, 
through a voice service or media platform, such as the 
Hewlett-Packard OpenCall Media (OCMP) platform. For 
clarity of explanation only a simplified view of the media 
platform 314 is shown. 

[0046] Figure 3 shows a general telecommunications 
system 300 in which a user 302 may connect to the me- 
dia platform 31 4 through a telecommunications network 
304. The telecommunications network 304 may be, for 
example, an SS7 based PSTN, a voice over IP (VoIP) 
network, or any other suitable network. The media plat- 
form 31 4 may be connected to the network by a high 
capacity transmission link 31 2, such as an optical SON- 
ET link, capable of carrying thousands of simultaneous 
voice calls as will be appreciated by those skilled in the 
art. 

[0047] The media platform 31 4 enables the user 302 
to place additional calls, for example, to an audio 
streaming service 308, such as an audio share service 
providing details of share prices, and an audio confer- 
ence server 306. The media platform 314 comprises a 
mixing and routing element 318 which, in conjunction 
with a controller 31 6, manages the multiple connections 
and controls the appropriate mixing and routing of the 
available voice channels such that the user may control 
through which voice channels he wishes to communi- 
cate. The direction of the voice paths within the system 
300 is illustrated by the various dotted lines. For exam- 



ple, the audio streaming service 308 is shown as being 
a streaming only service with the audio path being 
shown as unidirectional from the streaming service 308 
to the media platform 31 4. Bi-directional audio paths are 
shown between media platform 314 and the conference 
server 306 and between the user 302 and the media 
platform 314. 

[0048] As previously described, the user 302 may 
configure the media platform, for example through the 
media platform, such that no audio signals from the au- 
dio streaming service are sent to the user, for example, 
whilst the user is participating in the audio conference 
provided by the audio conference server 306. To help 
ensure that any information relevant to the user por- 
trayed in the audio channel from the audio streaming 
service 308 is not missed, the user may configure the 
media platform to monitor this audio channel and to gen- 
erate an alert whenever a predeterminable event such 
as a keyword is detected therein. 
[0049] For example, in a configuration mode the user 
302 may define one or more keywords to be detected in 
a selected voice channel. The keyword(s) may, for ex- 
ample, be stored in an alert manager 320, which may in 
turn communicate the keyword(s) to an automatic 
speech recognition (ASR) engine 322. In the example 
shown the ASR engine 322 is remote from the media 
platform 314 and is connected via a link 324. Preferably 
the link 324 is a real-time protocol (RTP) link. Once the 
system is configured, the user may participate in the au- 
dio conference provided by the audio conference server 
306 in the normal manner. The mixing/routing element 
318 routes the voice channel from the audio streaming 
service 308 to the remote ASR engine 322 via the link 
324. Upon detection of any of the defined keywords by 
the ASR engine 322 a signal is sent to the alert manager 
320 which generates an appropriate alert, for example, 
in any previously described manner. 
[0050] Figure 4 is a block diagram illustrating a yet 
further embodiment in relation to a voice over IP (VoIP) 
system. A VoIP compatible user terminal 402 may con- 
nect to a number of other appropriate user terminals 404 
and 406 through an Internet protocol (IP) network 408. 
The user terminals may, for example, be Vol P telephone 
terminals or suitable equipped computer terminals. The 
user terminal 402 may be configured, in a generally 
known manner, to establish two separate voice chan- 
nels, for example a foreground voice channel 410 be- 
tween the user terminals 402 and 404, and a back- 
ground voice channel 412 between the user terminals 
402 and 406. The voice channels 41 0 and 41 2 are input 
to a monitoring element 414. The functionality provided 
by the monitoring element 414 may be similar in nature 
to that provided by the monitoring element 108 de- 
scribed above in relation to Figure 2. Generally such a 
monitoring element may be implemented in any location 
where access can be gained to the individual voice 
channels. 

[0051] In a further embodiment a monitoring element 
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as generally described above, if necessary in conjunc- 
tion with suitable telephony equipment, may be config- 
ured to automatically establish a call to a predetermined 
telephone number or destination whenever a telephone 
call is made, and to monitor that call or voice channel 5 
for one or more predeterminable voice tags. 
[0052] The present invention may be used, for exam- 
ple, to enable a telephone based share price information 
service to be monitored whilst in a separate telephone 
call. By configuring the monitoring element in an appro- to 
priate manner, for example, by defining the voice tags 
to be those of the company names whose shares are to 
be monitored, any relevant information relating to those 
companies will be alerted to the user. 
[0053] The present invention may also be used, for 15 
example, when a call is made to customer service call 
center. Often such calls are answered by an automated 
voice queuing application which informs the caller that 
the call will be answered as soon as an operator is avail- 
able. Such calls may involve long waits whilst waiting in 20 
the queue. By making appropriate use of the herein-de- 
scribed monitoring system the time normally wasted 
waiting for the call to be answered may be advanta- 
geously used, for example, to place an additional tele- 
phone call. The user can thereby await an appropriate 25 
alert from the monitoring element to indicate when the 
initial call has been answered. 
[0054] Those skilled in the art will appreciate that any 
type of voice channel may be monitored, whether the 
voice channel be a foreground voice channel, a back- 30 
ground voice channel or whatever. In a system in which 
multiple voice channels may be monitored there is pref- 
erably no restriction on the type of voice channels which 
may be monitored. 

[0055] It will also be appreciated that monitoring is not 35 
limited to monitoring a voice channel for the presence 
of a voice tag or keyword. For example, monitoring may 
be performed to detect the presence of any definable 
event, for example a silence period, a call being an- 
swered and so on. Depending on particular require- *o 
ments the system may be adapted to monitor voice 
channels for the presence of both voice tags and other 
definable events. Where a detectible event is a silence 
period this may be used, for example, in a situation 
wherein a user is participating in multiple conference 
calls at one time but is only listening to one of the con- 
ference calls. In this case, should a question be asked 
of the user in a conference call to which the user is not 
listening, the detection of a silence period may alert the 
user that a response is expected. It may, therefore, be so 
particularly useful to provide a replay of a recorded pe- 
riod of the audio signals occurring before the silence pe- 
riod was detected in order to help enable the user to 
regain the current context of the conference call. 
[0056] Other definable events may include, for exam- 55 
pie, semantic content of the voice channel being moni- 
tored, intonation-based triggers such as the detection of 
questions based on detecting appropriate intonation, 



speaker recognition and so on. 



Claims 

1. Apparatus for managing a communications sys- 
tem having a plurality of voice channels and a user 
terminal for receiving at least one of the voice chan- 
nels, comprising: 

a receiving element for receiving a plurality of 
the voice channels; 

a controller for identifying one of the voice 

channels to be monitored; 

an event detection element for detecting the 

presence of a predeterminable event in the 

identified voice channel; and 

an alert generator for generating an alert when 

the predetermined event is detected. 

2. The apparatus of claim 1 , wherein the controller 
is adapted to identify a voice channel in response 
to a request from the user terminal. 

3. The apparatus of claim 1 or 2, wherein the pre- 
determinable event is the occurrence of a keyword 
and wherein the event detection element is adapted 
to detect a keyword. 

4. The apparatus of claim 1 or 2, wherein the pre- 
determinable event is a silence period. 

5. The apparatus of claim 1 , 2 or 3, wherein the con- 
troller is adapted for identifying a plurality of voice 
channels to be monitored and wherein the event de- 
tection element is adapted for monitoring each se- 
lected voice channel for a different event. 

5. The apparatus of claim 1, 2, 3, 4 or 5, wherein 
the alert generator is adapted for transmitting an au- 
dible alert to the user terminal. 

6. The apparatus of claim 5, wherein the audible 
alert is transmitted by mixing an audible alert with 
the at least one voice channel received by the user 
terminal. 

7. The apparatus of claim 6, wherein the audible 
alert is transmitted at a time when the audio level of 
the at least one voice channel received by the user 
terminal is below a predetermined threshold. 

8. The apparatus of claim 1 , 2, 3, 4 or 5, wherein 
the alert generator is adapted for transmitting a sig- 
nal to the user terminal to thereby cause the user 
terminal to generate a local alert. 

9. The apparatus of any preceding claim, in which 
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the events to be detected are definable by the user 
of the user terminal. 

10. The apparatus of any preceding claim, further 
comprising a recording element to record a portion 5 
of the monitored voice channel around the detected 
event, and wherein the alert generator is adapted 

to playing the recorded portion to the user. 

11. The apparatus of any preceding claim, further 10 
comprising an element for automatically establish- 
ing a voice channel with a predeterminable destina- 
tion, and wherein the controller is adapted for se- 
lecting that voice channel for monitoring for a pre- 
determinable event. *s 

12. A method of managing a communications sys- 
tem having a plurality of voice channels and a user 
terminal for receiving at least one of the voice chan- 
nels comprising: 20 

receiving a plurality of the voice channels; 
identifying one of the voice channels to be mon- 
itored; 

detecting the presence of a predeterminable 25 
event in the identified voice channel; and 
generating an alert when the predeterminable 
event is detected. 

13. The method according to claim 12, wherein the 30 
step of identifying a voice channel is made in re- 
sponse to a request from the user terminal. 

1 4. The method according to claim 1 2 or 1 3, where- 
in the step of detecting is adapted for detecting the 35 
occurrence of a keyword. 

15. The method according to claim 12 or 13, where- 
in the step of detecting is adapted for detecting a 
silence period. *o 

16. The method according to claim 12, 13, 14 or 15, 
wherein the step of identifying a voice channel is 
adapted for identifying a plurality of voice channels 
and wherein the step of detecting is adapted for 45 
monitoring each selected voice channel for a differ- 
ent event. 



prising transmitting the alert to the user terminal at 
a time when the audio level of the at least one voice 
channel received by the user terminal is below a 
predeterminable threshold. 

20. The method according to any of claims 1 2 to 1 6, 
wherein the step of generating an alert further com- 
prises transmitting a signal to the user terminal to 
thereby cause the user terminal to generate a local 
alert. 

21 . The method according to any of claims 1 2 to 20, 
wherein the event to be detected is user-definable. 

22. The method of any of claims 12 to 21 , further 
comprising automatically establishing a voice chan- 
nel with a predeterminable destination, and select- 
ing that voice channel for monitoring for a predeter- 
minable event. 

23. A user terminal operating in accordance with the 
method of any of claims 12 to 21 . 

24. Apparatus for managing a telecommunications 
system having a plurality of voice channels and a 
user terminal for receiving at least one of the voice 
channels comprising: 

a receiving element for receiving a plurality of 
the voice channels; 

a controller for identifying one of the voice 

channels to be monitored; 

a speech recognition engine for detecting the 

presence of a predeterminable keyword in the 

identified voice channel; and 

an alert generator for generating an alert when 

the predeterminable keyword is detected. 



1 7. The method according to any of claims 1 2 to 1 6, 
wherein the step of generating an alert further com- so 
prises transmitting an audible alert to the user ter- 
minal. 

18. The method according to claim 17, further com- 
prising mixing an audible alert with the at least one 55 
voice channel received by the user terminal. 

1 9. The method according to claim 1 8, further com- 
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